
SEQUENCE LISTIN(/ 



U MT^GMt'^^M onto K . 
E'ELLA-CIOPPA, Guy R. 
ERV;iN, Rcberc 1. 
KcGEE, David R. 

<12C> METHOD OF DETERMINING THE PRESENCE CF A 

TRAIT IN A PLANT BY TRANSFECTING A NUCLEIC ACID SEQUEi:CE OF 
A NON-PLANT LONOR INTO A HOST PLANT IN A POSITIVE 
ORIENTATION 



<13C> 003010137US07 

<14C> 09/359,3:0 
•.;14 1> 1999-07-2 1 

<16C> 60 

<170> FastSEQ for VJindows Version 3.0 

<2 1C> 1 
<2 11> 2 6 
<212> DNA 

■■:2 13> Tomato mosaic virus 



<400> : 

ctcgcaaagt ttcgaaccaa atcctc 



26 



<2 1C> 2 
<211> 35 
<2 12> DNA 

<:2 13> Tomato mc'saic virus 



<4 0C> 2 

cggggtacct gggccccaac cgggggttcc ggggg 



35 



< 2 1 0 > 3 

< 2 1 1 > 4 1 
<212> DNA 

<2 13> Tomato mosaic virus 



< 4 D 0 > 3 

tcctcgagcc taggctcgca aagtttcgaa ccaaatcctc a 



41 



<2 10:- 4 

< 2 1 1 > 3 5 

<2 12:> DNA 

<2 1j^> Tomato mC'Saic 



us 



< 4 D Ci - 4 

rggggtacct ggg::cccaar cgggggttcc ggggg 



35 



:210> 5 



1 1() 



<21.3:- ToT.ato m:>saic virus 

tacgtaiggt gragaagaar agat 

<21j> 6 

•;2i::- 24 

<2i:> 2)aA 

<2iy.- TiT.atC' m::saic virus 

agtcgactct tr-rtcttct^ gcat 

<2 10:- 7 

< 2 1 : > }• C 

<212;- DI:A 

-:21j.> TC'matC' mi'saic virus 



<400> 7 

tgctcgagtq tgttcttcag ttttctgtca 

<210> S 

< 2 1 1 > 3 0 

<2i2> d::a 

<21j> Tcmato mosaic virus 

< 4 0 (3 > o 

aactcgagcG ctttgatttc tccgaagctt 

< 2 1 0 > ? 
..2i:> 114 
^^2 12> OKA 

<213> T-:matc mC'Saic virus 

<.:2 2 0 > 
<221> CIS 

<222> (28) ... ( 115) 

< 4 0 C: > 9 

gtttcaaata cgctcgaggt tttaaat atg tct gtt gcc ttg tta tgg gtt gtt 

Met Ser Val Ala Leu Leu Trp Val Val 
1 5 

tct cct tqr gac gtc ica aat ggg aca agt ttc atg gaa tea gtc egg 
Ser Pro Cys Asp Val Ser Asn Gly Thr Ser Phe Met Glu Ser Val Arg 

10 15 20 25 

9^9 99^^ ^'■^^ 
Glu Gly Asu Arg 



•:2 10 



:2 13:^ TC'ir.atC' mosaic virus 



2 16 



<4dO. 10 

Met Ser Val Ala Leu Leu Trp Val Val Ser Pro Cys Asp Val Ser Asn 

1 5 10 15 

Gly Thr" 3er ?ne Met Glu Ser Val Arg Glu Gly Asn Arg 
2 2 5 

<2 10:-' 11 

<2 11,^ 3 9 

<2 12> DNA 

<2 13> Nicotiana benthamiana 



<4 0 0:- 11 

gcctcgagtg cagcatggaa acccttctaa agcttttcc 39 
<210> 12 

<2i:> 3b 

■ 2 1 2 - ■ D 1 J A 

<213> Nidotiana benthamiana 
<4 0 0 :- 12 

tccctaggtc aaaggctctc tattgctaga ttgccc 36 

< 2 1 0 > 13 
<211> 111 
.:212> DNA 

<213> Tobacco mosaic virus 

<22 0> 

<2 2 1:- CDS 



■ion. 



(25) . . . (Ill) 



< 4 0 Ci > 1 3 

gttttaaata cgctcgagtg cage atg gaa acc ctt eta aag cct ttt cca 51 

Met Glu Thr Leu Leu Lys Pro Phe Pro 
1 5 

tct cct tta ctt tec att cct act cct aac atg tat agt ttc aaa cac 99 
Ser Pro Leu Leu Ser lie Pro Thr Pro Asn Met Tyr Ser Phe Lys His 
10 15 20 25 

aacttiractttt 111 
Asn Phe Thr Phe 



v210> 14 
<2 11> 2 9 
<212> PRT 

<213> Tobacco mosaic virus 
<400> 14 

Ket Glu Thr Leu Leu Lys Pro Phe Pro Ser Pro Leu Leu Ser lie Pro 

15 10 15 

Thr Pro Asn Met T\'r Ser Phe Lys His Asn Phe Thr Phe 
2 0 2 5 

< 2 1 0 > 1 5 



1() 



<211> 44 
<212 :■ DNA 

< 2 1 3 . E r v; i n i a ne r b i c o 1 a 



<-.0 0.- 15 

rcaagcttirt ccagtgcagc atgcagcaac cgccgccgct tgac 



44 



< 2 1 0 ■ 16 

<2i:. 43 

<2 12.: DNA 

<2 13. ■ Erv.'inia nerbicola 



<4 0 0:-- 1 6 

aagatctcr: gagrtaaacg gga::g::tgcc aaagaccggc cgc 



43 



2 1 J ■ 1 7 

2 1 1 : 2 3 

2 12: L'MA 

2i:-: Tocacco mild green mosaic virus 



< 4 0 C' :: 1 7 

tgtgaaacr: gaaaaggttc egg 



23 



< 2 1 

< 2 1 1 
v2 12 

< 2 1 3 



■ 18 

■ 3 6 

■ e--::a 

Tobacco mild green mosaic virus 



^ ^ 0 : : 1 a 

caaaataccr. aaaccactac cqQcaqttaq aqqaqq 



36 



.2i:- 19 

.111:- 3 1 

;2i::- DNA 

^21:: P:.ograss mosaic virus 



.400> 19 

tactcgaqgt: tcataagacc gcggtaggcg g 



31 



:rass mosaic v i r u s 



cggggtacc: gggcccctac ccggggttta gggagg 



36 



•:210: 21 

■.2 11- 10^ 

:L12:- TNA 

■.:2 13 : N . tabacum 



4 10 




gtttcaaata cgcrcgagcc atg get tec tea gtt ett tee tct gea gea gtt 53 

Met Ala Ser Ser Val Leu Ser Ser Ala Ala Val 
1 5 10 

gee aer cgc age aat gtt get eaa get aac atg gtt gea eet ttc act 101 
A.la Thr Arg Ser Asn Val Ala Gin Ala Asn Met Val Ala Pro Phe Thr 



g g c ett 
Gly Leu 



<210- 2 2 

<2 11> 2 9 

• : 2 1 2 ' P R 1 

■:2 1 3 ■ IJ . t abaeum 

<400> 22 

Met Ala Ser Ser Val Leu Ser Ser Ala Ala Val Ala Thr Arg Ser Asn 

15 10 15 

Val Ala Giri Ala Asn Met Val Ala Pro Phe Thr Gly Leu 
20 25 

<2 10:- 2 3 
<211r- 1543 
<212: DMA 

<2 13;' T-:)caeeo mild green mosaic virus 



<4 0 0- 
etcgaggc t z 
t tgteagata 
aegagree aa 
et t tgacg ag 
::tgagte ret 
tgggtgtttt 
gggtaae agt 
-gtaergagc 
t a t e e a 1 1 'g r 
aga tega ag z 
ataacgtgg t 
aaggtt t ::g a 
acagttt ::g 
gat atag a re 
tacaacatra 
gegatg::t t a 
gat ac tgtta 
1 1 1 e raga aa 

r ::a t ::ggc 1 1 
aggagr ra a a 
g r a g a g 1 1 :: g 
: g a t a g t g r a 
r c q c a a c cj t 



ata ^gac ::g : 
aaa ggt tgt t 
age tagtgae 
at c aa agact 
gtgtgat at t 
gggtgttgtt 
gagegtgatt 
egetgtaaag 
gga tgc caag 
egg atgg zaa 
tgt ta aaggr 
aggtgtggt t 
aaagaaaaag 
ggagagatae 
rgagc tcgaa 
aeragtgzgt 
gaeag ragt t 
eaggatac 
age :e 1 1 ega 
r c 3 a a g t .a t :: 
tt ragrtttt 
aggr 1 1 tatt 

aa r ^ t aaa : r 



ggraggegga 
taaagat t tg 
ttccttgctc 
gtetetatta 
gacttgttag 
ttcaccggtg 
gacaaacggc 
gacagaaggt 
egaaaaeegt 
eeeetaget e 
ttgagggaaa 
gaegat t: teg 
aaaaagaetg 
gc:ggte-tg 
t cagtac cag 
gtetgegttg 
etetaacett 
ggtgtatatt 
taetagaaat 
t a a t g :: a. a c a 
getgaacgag 
accttggget 
cteterartt 
a a. at r a cgc a 
1 1 aa rgt ^eg 
certaarrec 



gcgtttgttt 
ttttttgttt 
t taegaaaaa 
gtaet aagga 
tgaatgtgce 
aatggctggt 
ttgaaaattc 
tceagt tcaa 
ggeaggttca 
tagaggtggt 
aggtcatcgc 
t rgat t eggt 
gaggaaggga 
act ege taca 
tatttcgcag 
tcgcaatcgt 
ctgagtgcga 
aattcagcag 
aggat rattg 
caacgtgt tg 
rtet e : aaeg 
azzgzq ::cag 
a aat rg aaga 
atgt 1 1 tagg 
cgtggr gt at 
gggtagggge 



a r tgtagtat 
gactgagtcg 
ggaggaaatt 
rget atatct 
attagataag 
accggatttc 
eagagagtgc 
gerggtteca 
tgtg cgaa 1 1 
ttctgttg — 
agtgaatgat 
tgctgcat t z 
tg taaa taa t 
aea t aaaga a 
cgatgtgggc 
ateaaaetea 
ccgtgaeac e 
1 1 c t a a a a c c 
aaaetgaaga 
atga tgcga :: 
gacatggt zz 
eta rat agg r 
g a t a t a e 1 1 a 
ttccatttaa 
a 1 1 a c a g t g g 
zza 



aattaaatat 
ataatgtct t 
t tacc caagg 
gttaaggagt 
tataggtatg 
gttaaaggtg 
ataat tggta 
aattacttcg 
caaa at etga 
atggt t a r ta 
rcgaaegteg 
aagg rga t tg 
aat aagt ata 
gaaaatggt t 
agag rc r aca 
ggeggcaaga 
gaaeeagcgg 
gt tgtaegag 
agagtcgcgt 
cgtggcrat t 
gatgaa ragg 
gtgg t gea z a 
rggt gt aat t 
a t c g a a a c r t 
a a a t a a e t a a 



12 
IS 
24 
3u 
3t 
42 

4e 

54 

6 0 
6 6 



8^ 
9 0 
96 
102 
106 
114 



5 16 



55 

rape mosaic virus 



: a r o c C : 



raatacgact cactaragtt ttatttttgt tgcaacaaca acaar 



55 



:2 1 : 

■ n 1 -■ 



:tt3tgcc:;r. tcatgacgag ctatatcacg 



30 



:■: 2-5 

2 I 1 . 4 9 7 

<212:. D:JA 

-:2 13> rape mcisaic virus 



rcttaaiarg 
caacaaaaca 
aac at tgc ag 
ttatgacaat 
caaatcagtg 
ttccctttac 
agttagag t a 
acr t tgc age 
atgt acgtga 



acvca rtata 
aac Hcaaaca 
gctgccgcag 
gctgtcgagg 
tctacggaac 
tcatacccaa 
tct catgatg 
gcaccttt tc 
tat age t 



gttt tatttt 
acaacaacat 
ggcgcaacag 
agctaaatgc 
atgacgctgt 
catgccgtac 
caagttccgt 
aaaggacgcg 



t gttgcaaca 
ggcacaattt 
cctggtgaat 
acgctcgaga 
tagct tcaaa 
actcccttgc 
tcggttctct 
actacgttca 



acaacaacaa 
caacaaa rag 
gatttagcct 
cgccctaagg 
cgcttatccg 
gggtggccta 
gacgtacgac 
ctgctgtatg 



at ta ::aat aa 
taaacatgca 
cacgacgtgt 
ttcattactc 
gagtttgaga 
aggactct tg 
atcggtggta 
ccaaact tgg 



60 
120 
180 
240 
300 
360 
420 
480 
497 





.:2 


1 " 








: 1. : 










-2 1.2 


■ CIJA 








■.213 


• rape 


mc s a i c 


vi rus 




DO 


2 " 






ga 






rgact ca 


ctatagtt 




^ 2 1 0 


2 c 








^: 2 1 1 


:■ 3:- 








.:2 12 


D!JA 












mc s a i c 


VI rus 




■ :4 : 0 


o ■ - 






at 


:gtttaaa 


c tggg 


rccct ac 


ccggggtt 




V 2 1 0 


2 










4 97 










■ 2i:a 








-.: 2 1 3 


• rape 


mosaic 


vi rus 














1 1 a a t a r g 


act c a 


r t a t a g t 


tttatttt 


ca 


ji c a a 3. a c a 


a a t a c 


a a a c a a e 


a a c a a c a t 


a a 


r a 1 1 c c a c 


gctgc 


7 g c a g g g 


cgcaacag 



55 



: g 1 1 g c a a c a a c a a c a a : 
iqcacaattt caacaaa: 



a a attacaataa 
ag taaacatgca 
ct cacQacatoc 



0 10 



ttacgacaat 


3 


::t3 


tcgagg 


agctaaatgc 


acgctcgaga 


cgccctaagg 


t tcattactc 


240 


caaatcagrg 






ggaac 


a ga rgct gtt 


agct t caaac 


gcttatccgg 


agt t tgagat 


3 00 






at a 


" ^ a a a 


catg:;cgtac 


artccrttgc 


gggtgc'ircta 


aggact c t tg 


3 60 


ag ccagajt^ h 






a t 3 a t g 


caa gt t c ::gt 


tcggttct =t 


ga cgtacgac 


at egg t ggt a 




actt t j^aqz 


3 


rac 


: 1 1 1 1 c 


aaaggacgcg 


actacgt tea 


ctgctgtatg 


ccaaacttgg 


4 80 


atgt: a rg tga 




at a 












4 97 



2 1 0 ^ 
<3 1 1 
< 2 1 2 
<213. 



55 
DNA 

race T.osaic virus 



•;^0 0 
gatgg::gr<:::. 

<2 1 0 

-:2 I I 



taataegact cactatagtt ttatttttgt tgcaacaaca acaac 
• 31 

di:a 

rape mosaic virus 



atcg: 



laa ::tgggcccct acccggggtt agggagg 

. 1 0 : 3 2 

.11: 2 5 

.12, C'NA 

;I3: Picia pas tor is 



55 



ttgractctg ttggctcatg acgat 



: 1 0 
■11 
■12 

: 1 .3 



33 
2 6 
DUA 

Pichia pastor is 



25 



< 4 0 0 
:aag c t tgca 

< 2 1 1 
<212 
<21 3 



caaacgaacg tctcac 

, 3 

4 2 
■ 21JA 

Pichia pastc>ris 



< 4 : 

;cgaqaq 



:ggctatt cccgaagaat ttgatattat eg 



2 1 1 
2 1 2 



J D 

2'NA 

Pirhia oast oris 



-4 2 2: 3 5 

tccrtaggtt agaatctagc aagaccggtc ttctcg 



" 16 



<2 12 ■ ArabidcDsis tha liana 



rcgagcygc:: gcat 



Trichosanthes kirilov/ii 



■.:4 37 

gc::tcgag::g C5g::atgatc agattctcag tcctctcttt gc 

■:2i:: 3S 

..:2 11:- 3 6 

■.:2 12:. E>:JA 

-:2 1}:- Trichosanthes kiril3v;ii 



tcrctaggcc aaaragcata acttccacat caaagc 



36 



< 2 1 0 

< 2 1 2 

< 2 1 3 



3 9 
109 

Lr:A 

Trichosanthes kirilo'.vii 



CDS 

(23). ..(109) 



^: 4 ■: : 3 9 

gttttaaata cgcccgagg atg ate aga ttc tta gtc etc tot ttg eta att 

Net lie Arg Phe Leu Val Leu Ser Leu Leu lie 

1 5 10 



52 



etc acc etc ttc eta aca act cct get gtg gag ggc gat gtt age ttc 
Leu Thr Leu Ph.e Leu Thr Thr Pro Ala Val Glu Gly Asp Val Ser Phe 
15 2 0 2 5 



100 



:gt tta tc.i 
\ra Leu Se]- 



10 9 



^:21';:' 4C 

< 2 1 1 : ■ 3 0 

<2 12:- FRT 

<2 13> Trichosanthes kirilov;ii 

eu I'al Leu Ser Leu Leu lie Leu Tr.r Leu Phe Leu 

5 10 15 

ai Glu Gly Asp \'al Ser Phe Arg Leu Ser 

2 5 3 0 




S 1() 



-.2 11'- 19 

■:212> PRT 

•: 2 1 3 . • ? yce 1 1 1 

■ : 4 L J . ■ 41 

Ser T\'r '/a.! Pro Ser Ala 31u Gin lie Leu Glu Phe Val L\-^3 Gin lie 

1 5 10 15 

Ser Ser 3 In 



■ : 2 1 0 : 4 2 

■.2:i:- S35 

■:2 12:- DNA 

•.:2i3; IJizctiana bent harr.i ana 



■.: ^ J . 

2 2 2 > ( 1 5 ) , . . ( 6 7 7 ) 
< 4 0 0 : 4 2 

cttcazztzc gicg atg get eta cct aac cag ::aa a-zz gtg gar tac cct 50 

Met Ala Leu Pro Asn Gin Gin Thr Val Asp Tyr Pro 

1 5 10 

age ttc aag etc gtt ate gtt gge gat gga ggc aea ggg aag ace aca 9 8 

Ser Pn^r Lys Leu Val lie Val Gly Asp Gly Gly Tn r Sly Lys Thr Tnr 

1 2 0 2 5 

ttt gta aag aga cat ctt aet gga gag ttt gag aag aag tae gaa ece 146 

Phe Val Lys Arg His Leu Thr Gly Glu Phe Glu Lys Lys Tyr Glu Pro 

3 3 5 4 3 

aet at*, ggt gtt gag gtt eat eet ctt gat tte tee a:t aae tgt gge 194 

Thr lie Gl'y Val Glu Val His Pro Leu Asp Phe Phe Thr Asn Gys Gly 

45 50 55 6 0 

aag ate :'gt tte tae tgt tgg gat act get ggc caa gag aaa ttt ggt 242 

Lys lie Arg Phe Tyr Sys Trp Asp Thr Ala Gly Gin Glu Lys Phe Gly 



ggt ctt agg gat ggt tac tac ate cat gga caa tgt get ate ate atg 2 90 

Gly Leu Arg Asp Gly T\'r Tyr lie His Gly Gin Cys Ala lie He Met 

8 0 8 5 9 3 

ttt gat gte aca gca cga etg aea tac aag aat gtt c::a aea tgg ea:; 3 38 

Phe Asj:.' \'a 1 Thr Ala Arg Leu Thr Tyr Lys Asn Val Pro Thr Trp His 

9 5 1 0 13 13 5 

egt gat ttt tge agg gtt tgt gaa aae ate eea att g:t ett tgt ggg 386 

Arg Asp Leu C\'S Arg \'al Gys Glu Asn He Pro lie l*a 1 Leu Gys Sly 

110 115 12 3 

aat aaa gtt gat gtg aag aae agg caa gte aag gcc aag eag gta aca 4 34 

Asn Lys \'al Asp \'a 1 Lys Asn Arg Gin Val Lys Ala Lys Gin Val Th. r 

12 5 13 0 13 5 14 3 

tte eae agg aag aag aae etc eag tat tae gag ata tct ger aag age 452 



10 



-he Kis Arg Lys Lys Asn Leu Gin lyr Tyr 3lu lie Ser Ala Lys Ser 



Asn T- 



14 5 



^s:: ?he 

16: 



150 
:g rac 



Pr::; Phe Leu 
16 5 



15: 



rtt. get aga aag ct:" acz 
Leu Ala Ara L/s Le ^ Ala 



399 ga: get aat ctt cac ret gtg gaa tea cct gcc ttt grt zee ccg 
3ly Asp Ala Asa Leu His ?ne Val Glu Ser Pro Ala Leu Ala Pro Pre- 
175 180 185 



578 



gaa gtt caa ate gac etg get get eag cag eag eat gag geg gag rtt 
Glu Val Oln lie Asp Leu Ala Ala Gin Gin Gin His 31u Ala Glu Leu 



1 9 : 



1 9'. 



626 



gea gra gea gea agt rag era ret eet gat gae gat gat gar ac r tte 
Ala Ala Ala Ala Ser Gin Pro Leu Pro Asp Asp Asp Asp Aep Thr Phe 



210 



y 0 ■ 



gag tagagaaaga gagatgtgat etgtcaetga ttaeccgtta gggcttgtct 
Glu 



727 



gaactttttt ttgtteatgg tgrtatcttt atgtgtctgt 
eattagcaat ttteattttt aagtctttaa ctgtegccat 



aetttgaaat gaatcgatga 787 
gaaagtgaaa ac 83 9 



•■ 2 1 0 
:2 1 1 
L 2 ' 

:: 2 1 3 



43 

22 1 
PRT 

Nicotiana benthamiana 



<400> 4 3 

Met Ala Leu Pro Asn Gin Gin Thr Val Asp Tyr 



1 



10 



/al lie Val Gly Asp Gly Gly Thr Gly Lys Thr 



^ 0 



25 



His Leu Thr Gly Glu Phe Glu Lys Lys Tyr Glu 
3 5 4 0 

His Pro Leu Asp Pne Phe Thr Asn 



;lu Wa:. 



Pro 
Tnr 
Pro 
Ovs Glv 



Tyr C\'s Tro Aso Thr Ala GL\' (3 In Glu Lys Phe 



Gly Tyr 
Ala Arg 
Arg \'a 1 



L45 

}lu L'.-s 



Tyr He His Gly Gin Gys Ala He He 

90 



35 

Thr Ty2 

100 



115 
Asn 



ys Asn Val Pro Thr Trp 
105 

jIu Asn lie Pre- lie \'al Leu Gys 
120 



Gly 
Met 
His 
Gly 



Arg Gin Val Lys Ala Lys Gin Val Tnr 
13 5 1 4 C 

: S e r 



. ^ e o e r a a 



150 



.a Arg 



155 
Leu 



165 



:e va . 



13 >- 



A.so Leu Ala 



Glu Ser 
Gin Gin Gin 



170 

Ala Leu Ala Pro 
185 

His Glu Ala Glu 



Pre 



Ger Phe Lys 
15 

Phe Val Lys 
3 0 

Thr He Gly 
45 

Lys I I e Arg 

Gly Leu Arg 

Phe Asp Val 
95 

Ai'g Asp Leu 
110 

Asn Lys \^a I 
125 

Pile His Arc 



Leu 

Arg 

Val 

Phe 

Asp' 
3 0 
Thr 

C\' s 

Asp' 

Lvs 



.-Asn Pne 



L s c ^ a 
175 

21 u Val Gin 
19 3 

Ala Ala Ala 



10 1() 



195 200 2C5 

?ro Leu Pro Asp Asp Asp Asp Asp Thr Phe 31u 



44 



Nicotiana benthamiana 



< 4 3 J 


. 44 












ctt r arttt 


grcgatggct 


c tacc t a acc 


agcaaa rcgt 


ggattacect 


agct t caag r 


60 


t cgt t ate g:; 


t g g tgat gga 


ggtacaggga 


agacca ratt 


tgtaaagaga 


ratcttaetg 


120 








1 1 aa t 21 t oa 


yd--' — ■ 


ettgattt rt 


1 e 0 


tcartaa rt g 


t ggraagatc 


cgt t tct act 


gttggatact 


getggr r aag 


agaaat 1 1 gg 




tggtrctagg 


gat ggt t a rt 


acatccatgg 


acaat gtgct 


ate at ratgt 


1 1 g a t g t c a r 


3 0 


agcacga rig 


a ratacaaga 


atgttetaac 


atggcaccgt 


gatctttgr a 


gggttttgtg 


3 6 :■■ 


aaa acat c r 


aattgttrtt 


tgt gggaata 


aagt t gatgt 


gaag a a r a gg 


eaagt r a a gg 


42 e 


ccaa-g rag gt 


a a c a 1 1 c t a r 


aggaagaaga 


ac etc c agta 


ttacgagata 


tct gecaaga 


4 to 


gcaactacaa 


ct tcgagaag 


ccattcttgt 


accttgctag 


aaagctcgc r 


ggggacgcta 


54 0 


atcttcartt 


tgtggaatca 


cctgcccttg 


ctccccrgga 


agt tcaa ate 


gacttggctg 


6 0 0 


■r tcag rag ra 


gratgaggcg 


gagcttgcag 


cagcagcaag 


tcagrca rtt 


cctgatgaeg 


6 6 0 


at gatga rac 


rtt-rgagtag 


agaaagagag 


atgtgatctg 


tcactgat ta 


crcgttaggg 


72 0 


rtcgtctgaa 


■rtttcttt 










738 


2 1 0 


. 4 5 












2 1 1. 


. '5 7 9 












■■; 2 12 


DNA 












-.; 2 1 3 


• Arabidops 


is t ha I i ana 










^: 4 D :;■ 


■ 45 












rttrartt': : 


gcrgatgg'-t 


c^a -'ctaacc 


ag raaa rcgt 


ggattacect 


agctt raage 


^: C' 


t c g *r *r a tr r g c 


tggrgatgga 


ggcacaggga 




^ \„ CI 1^ '-J CI '-j CI 


c a t c 1 1 a c t g 


1 2 0 


gagagrttga 


gaagaagtac 


gaacccacta 


ttggtgttga 


ggtt rateet 


cetgatttct 


1 8 0 


tca-r taa rtg 


t gg r a agate 


cgtt tct act 


gt tgggatac 


tgctggceaa 


gagaaatttg 


24 0 


gtggtcttag 


ggatggttar 


t a ratccatg 


gacaatgtgc 


tatcat ca tg 


tttgat gtea 


3 0 


cag racga-r c 


gacatac agg 


aatgttccaa 


catggcaccg 


tgat e 1 1 tgc 


agggtttgtg 


3 6 11 


rcaag-ragg : 


aacattc car 


aggaaggagg 


aactc cagta 


ttacgag ata 


tctgc caaga 


420 


gcaarca r aa 


rtt rgagaag 


ccattcttgt 


acct tg stag 


aaac r teg re 


ggggacgcta 


4 e 0 


atrtcrarti 


tgt ggaatca 


crtgcrcttg 


rtccrcrgga 


agtt raaa t c 


g a c 1 1 g g r t g 


5 4 0 


■rtragragra 


gcatgaggcg 


gagrttgcag 


cagcagcaag 


tcagc rac 1 1 


c rtgat ga rg 


6 J 'J 


atgatga ra-r 


cttcgagtag 


agaaagagag 


atgtgatctg 


t e a c t g a 1 1 a 


cecgt ta ggg 


6 6 0 


rttgt rtgaa 


rtttttttt 










6 7? 


< 2 I 2 


• 46 












<2 1 1 


6 6 7 












.:2 12 


• DXA 












.:2 1 3 


■■ N. tab a cum 










- ' 4 ^ " 


- 4 6 












a c g g r t c t a : 


c : a =1 r c a g r a 


aaecgtggat 


t a re r t age t 


tcaagrt rgt 


t at rgt t gg r 




gatggagg ra 


cagggaagac 


c a c a 1 1 1 g t a 


aagag a rat c 


t ta r tggaga 


g 1 1 tg a g a ag 




aagtatg a a r 


c r a r t at t gg 


t g 1 1 g aggt t 






taa rtgt gg r 






t r t a r t g 1 1 CI 


g g a r a r t g c t 


gg r ca a gag a 


a a 1 1 1 '-j g r gg 


t r 1 1 :a g g g a t 




ggttartara 


t r ra t gga ra 


a t g t g c t a t c 


at rat gt t tg 


a tgt e rage 


a r g a e t g a r a 


5 2 . 


t: a r a aga at g 


1 1 r c a a t a t g 


g rac rgtgat 


r 1 1 1 g r aggg 


1 1 tgt gaa aa 


r a t c e r a a 1 1 


3 6 0 


gttrtttgtg 


g g a a t a a a g t 


tgatgtgaag 


aa raggcaag 


t raagge r aa 


g raggtaae a 


4 2 


1 1 c r a r a g g a 


agaagaactt 


c ragtattac 


gagat at c tg 


c r aagag raa 


ctacaartte 


4 so 



1 1 10 



gagaag::cat rcttgtar::t tgctagaaag ctcgccgggg acgctaatct tcactttgtg 540 

gaatcarrtg crcttgct^r cccggaagtt caaatcgact cggrtgctra gcagcagcat 60D 

gaggcggagc ttgcagcag:: agraagtcag ccactt::ccg atgargatga tgacacrttr 660 

gag tag a 6'^'^ 



■:;2:i:.' 56 7 
<2 12.> DMA 
<2i3 ■ N. taba-ao; 



<4 0O> 4 7 

atggcrct a :: caaaccaaca aartgtagat tatccaagrt traag::ttgc aatcgtggg: 60 

gatggaggaa crgggaaaaz aactttrgtc aagaggratc tcacrggtga a::ttgagaag 120 

aaatargaa- ccactatrgg tgtggaggtt catccattag aittccccar aaattgcggg 130 

aaaacrrgct tctattgrtg ggatarcgct ggacaagaga agtrrggagg tcttcgggat 240 

ggrtactac.a tccatgggca atgcgcaatt atcacgtttg atgttacag:: ccgtctgacc 300 

tacaagaacg tti'Ctacgcg gcatcgagat ctctgcaggg tttgrgaaaa catc::c::atc 360 

gttccctgrg gaaacaaagt tgatgccaag aacaggcagg ttaaggcaaa gcaagtcacc 420 

ttccacagga agaaaaactc gcaataccat gagatctcag caaagagtaa ctacaacttt 480 

gagaagcctt ttccgtacct tgccagaaag cttgctgggg atgctaatct tcactttigtg 540 

gaatcacctg cacttgczcc ccctgaagta caaattgatt tagccgcaca gcaactgcat 600 

gaacaagagc tcttgcaagc cgctgcgcac gcacttccag atgacgatga tgaagctttt 660 

gaataga 667 



<210> 4S 

<2 1I:- 13 7 

<2 12> PRT 

<213> Tobacco 

<4 00> 4 c. 

Met Ala Leu Pro Asn 

1 5 

Val lie Val Gly Asp 
2C 

His Leu Thr Gly Glu 
3 5 

Glu Val Hi.s Pro Leu 
5 0 

Tyr Cys Trp Asp Thr 
65 

Gly Tyr Tyr lie His 
35 

Ala Arg Leu Thr Tyr 
IC 0 

Arg Val Cys Glu Asn 
11 5 

Val Lys Ask Arg Gin 
13 0 

<2 10- 4S- 

^:211:' 125 

<212: PF.T 

< 2 1 1 : Toca tco 



mosaic virus 



Gin Gin Thr Val Asp Tyr Pro Ser Phe Lys Leu 

10 15 
Gly Gly Thr Gly Lys Thr Thr Phe Val Lys Arg 

2 5 3 0 

Phe Glu Lys Lys Tyr GLu Pro Tnr lie Gly Val 

4 0 4 5 

Asp Phe Phe Thr Asn Cys Gly Lys lie Arg Phe 

55 60 
Ala Gly Gin Glu Lys Pne Gly Gly Leu Arg Asp 
7 0 7 5 8 0 

Gly Gin Cys Ala lie lie Met Phe Asp Val Thr 

90 95 
Lys Asn Val Pro Thr Trp His Arg Asp Leu Cys 

10 5 110 
rle Pro lie Val Leu Cys Gly Asn Lys Val Asp 

120 125 
Val Lys Ala Lys 
135 



Mosaic Virus 



Ala 



;ln Thr Val A£ 



Pr^ 



Ser Phe L\'s Leu 



12 10 



Val :le Val 31y Asp 3ly 31y Thr 31y Lys 

2 0 2 5 

His Leu Tnr 31y 31u Phe 31 u Lys Lys Tyr 

His Pro Lea Asp Phe Phe Thr Asn 
55 

?p Thr Ala 31y Gin Glu Lys 



Thr Thr 



. u Pro 



^rp 



Glu Val 
5 0 

Tyr Gys 
55 

Glv' Tyr -yr 

Ser Tnr Tnr 

Gly Leu Lys 
115 

Asn Arg Gin 

13 0; 



lie 



ICC 



Glv 31n 



S5 



rys Ala lie 
90 



^ys 

Phe 

75 

He 



Asp He Gin Glu Gys Ser Asn Met 



105 



HiH Ser Gin Leu Phe Phe Val 
120 

Val Lys Ala Gin 
135 



Gly 



31y 

X r 

31 y 
Met 
Ala 
He 



Phe 
Thr 
Lys 

Phe 

Pro 

Lys 
125 



V a 1 
3 2 
lie 



Asp 

Ser 
110 
Leu 



Lys 

' J ^ ^/ 

Arg 

Arg 

Val 

95 

Leu 

Met 



Arg 

Val 

Phe 

Asp 

80 

Thr 

Gin 

L'/s 



^21 'J > 
..211 

2 1 3 

.:22 3 
.■22 1 
. ■ o o ? 



5 0 
2 7 7 
GK'A 

Tobacco mosaic virus 



(1) . . . (2 77) 



■■;400> 
get act atg 
Ala Tnr Met 



5 0 

gtt 

Val 



gcc tct ccg get cag gcc act atg gtc get cct ttc 
Ala Ser Pro Ala Gin Ala Thr Mec Val Ala Pro Phe 
5 10 15 



48 



aac gga ctt aag tec tec get cct tec cag cca ccc gca agg eta aca 
Asn Gly Leu Lys Ser Ser Ala Pro Ser Gin Pro Pro Ala Arg Leu Thr 
20 25 30 



96 



acg aca tea ctt cca tea caa gca acg gcg gaa gag tta act gca tgc 
Thr Thr Leu Leu Pro Ser Gin Ala Thr Ala Glu Glu Leu Thr Ala Cys 
3 5 4 0 4 5 



144 



agg rgi 
Arg :y^: 



ggc etc cga ttg gaa aga aga agt ttg aga etc tct cct ace 
Gly Leu Arg Leu Glu Arg Arg Ser Leu Arg Leu Ser Leu Thr 
55 60 



19: 



1 1 r 
Phe 
65 



:ta acc tta c 



eg 
P r o 



at t 

lie 
70 



zcQ aat 
Pro Asn 



tgg eta 
Trp Leu 



agg aag ttg act 
Arg Lys Leu Thr 
^5 



acc 



tta 



30 



240 



tcr gca aca 
Ser Ala Thr 



act 
Ser 



gga 
Gly 
85 



ttc 
Phe 



:tt gtg 

^eu Val 



ttg aat 
Leu Asn 

90 



teg aag t 
Ser Lvs 



■;210> 51 
■:211> 9 2 



:csa 1 c 



13 10 



# 



;o ^ 5: 



1 



Arg C;/J 31y 
50 

?he Leu Tn:" 
Ser Al.-i Tnr 



.^a Gi: 



Lys Ser Ser Ala Fro Ser Glr Fro 

Leu Pr:> Ser Gin Ala Thr Ala Glu 
40 

Leu Ar5 Leu Glu Arg Arg Ser Leu 
55 

Leu Pro lie Pro Asn Tro Leu Aro 



70 

Ser Gly Phe 
35 



7 5 

\^al Leu Asn Ser 
90 



Pro Ala Arg Leu Tnr 

3 C 

Glu Leu Thr Ala Cys 

Arg Leu Ser Leu Tnr 
60 

Lys Leu Thr Thr Leu 
3 0 



2 10r- 5: 
2 1 1 > 1 •: 7 
2 12> OrLA 

2 1 }. r- Arabidopsis thaliana 



<400> 52 

actrgatcig tttcataota aaaccaaaac tcatgtttgr 

gcagraatca aaaatogtcc tataacaaaa ggaaatgcaa 

agtagcaggo aacattcctr ttcaotcgtc rtcttggcta 



tcactccaaa cacaaacaca 60 
caaaacagaa gaaacaacta 120 
cggagcc 167 



•:2 10- S': 
■:2 11> 3^-3 
<2 12r. TNA 

••:2 13:.- Arabidc'psis thaliana 
^:4 00> 5} 

gaaaogargt. cggctagrta ttgggcatgg cctgaccagc agcaacaaca tcacaatcat 60 
::atcagtcoa attgatcata ttgtctaaga acaacatcac actcatottg atarcattat 120 
ttatcai:raa aagaaaact:: cgtagatttt tctaataagc attttcaaat tatLtggcac 130 
gtttaaaatt aactaaattg ggttattatg tttacttgat ctgtttrata ctaaaaccaa 240 
aaggaaaaoc aaaacccatg tttgttcact ccaaacacaa acacagragt aatcaaaaat 300 
ogtottacaa caaaaaggaa atgcaacaaa acagaagaaa caactaagta gtaggcaaga 360 
ttct::::LLoa ccrgtcttct tggctacgga gcc 3 93 



: 1 j 
: 11 
! 12 
: 1 3 



P?.T 

Arabidopsis thaliana 



••:4 O'J:- 
Glu Thr Tnr 



Ser Al.H Ser T>'r Trp Ala Trp Pro 

5 10 

His His Gin Phe Asn 
1 ; 



Aso 



]ln Gin Gin 
1 1^ 



H':':r.o saDien£ 



^ a a o : 
raauc 



gsrgcatsag tgatgaagtg cgggaagggc aaactttgg: 
cacatrtsca tggcsaatts ccgccaaaac atcaggaagc 



tcgatscoaa 
tcatoaaaaa 



60 



14 



rggttr^atc atcaggaag^ caaccaagat tcactctrgt tccagagctr gcaaaatgaa 180 

gattg::caag acgaagggt:: gtcaccctgg atacggtaag aggaagggta ::ccgcgaagc 24 D 

taggrcg::ra acaaaggta:: tgtggatgcg taggatgrgt gttcttaggr gtct^ttgaa 303 

gaaataraga gagacgaaga agattgacaa gcacatgtac cangacatgr acatgcgcgt 300 

raagggtaat grgctcaaga ac:aag::gcgt cttgatggag agtat"ca::a agtcaaaggr 420 

ctagaagrta ggggagaa 43 3 



; 2 : 5 0 

■:2 1 1 ■ 4 33 

■:2 12 . 

•;2 13 ■ Ho:n:) sapiens 



-:43D.. 50 

gaa^aggrt: gc;c::tagtg tz2tzz^2tq tggcaagaag aaj^zzt^^z tagarc:::aa 0 j 

tgagazraat gaaatcgcra atgc:aartc ccgtcag:ag a^^cggaagr ccatcaaaga 12::) 

zq^^cz-j^zz atccgcaag:: ctgtgazggt zcattczcgg g::tcgatgcr ggaaaaacac IS-j 

itt 3 g r-L^ rgc ::ggaaggg::a ggcacatggg cataggtaag cggaagggca cagc::aatgc 24Ci 

ccgaatgcca gagaaggtca catggatgag gagaatgagg attttgcgcc ggctgcircag 300 

aagacaccgt gaatrtaaga agatcgatcg ccacatgtat caragcccgc acctgaaggt 360 

gaaggggaat gtgctcaaaa acaagcggat tctcatggaa cacatccaca agctgaaggc 420 

agaraaggcc ::g-::aagaa 4 OS 



2 1 0 : - 5 

; 2 1 I : ■ 4 J. 3 

<212:- D:JA 

■;2 13:' HDino sapiens 



^^4 00> 5^ 

gaag^ggctc gccgcatcag tgatgaagtg cgggaagggc aaagtttggc tcgatcccaa 60 

::gaaagct gacatctrra tggccaa^tc ccgccaaaac at^aggaagc rtgtgaagga 120 

cggr ~ t at 3 ar.^aggaag:: caaccaagat tcactcicgt cczagagrrc gcaaaatgaa IBO 

gattg:caag atgaagggt:: gtcactctgg atacggtaag aggaagggta cccgtgaagc 2 4'; 

taggttg::a acaaaggtat tgtggatgcg taggatgcgt gttcttaggc gtctgttgaa 3 0';) 

gaaatacaga gagacgaaga agattgacaa gcacatgtac catgacatgt acatgcgtgt 3 0l' 

taagggtaat gt^gttcaaga araagcgtgt cttgatggag agt at cca::a agtcaaaggc 

ttagaagrta ggggagaa 4 33 



-.2 10> 5c 

2 1 1 4 : 3 

-:212: D:JA 

■ : 2 1 3 H :)mc s ap lens 



•.4 00> SP 

gaagagg^t:: gcrtctagtg tcctccgctg tggcaagaag aaggtctggt taga:cccaa 6! 

tgagatjaat gaaatcgsta atgccaactc ccgtcagcag atrrggaag^ tcatcaaaga 12; 

tgggctgatt atccgraagt ctgtgarggt ccattctcgg gctrgatgt:: ggaa^ajic^c IS'. 

tttggtcc<gt rg'.gaagggta ggcaratggg cataggtaag rggaacggta cagctaatgc 24! 

ccgaatgcra gagaaggtca catggatgag gagaatgagg attttgcgcc ggctgctcag 30-: 

aagataccgt gaatctaaga agattgatcg ccacatgtat catagcctgt acctgaaggt 360 

gaaggggaat gtgttcaaaa acaagcggat tctcatggaa caratccaca agctgaaggc 420 

agacaagg:c rgraagaa 43 t 



.: 2 1 0 : ■ 5 - 



14 5 



Lys 

Lys 
65 



Arg 



Lys 
145 



<4 00> 5 9 
AT'J Leu Ala 



20 

lie Arg Lys 
35 

He Hi^: Ser 
50 

Gly Arg His 

Leu Pro Thr 

100 

H i Li Asp Ket 
HE 

Val Leu Met 
13 0 



A X a 
5 



Leu 

Arg 

Ser 

Lys 
85 

T - f o 

Tyr 
Glu 



Ser 7a 1 Met 



.ys Cys Gly Lys Gly Lys Val Trp 



icr Ser^ .-.sp ^^e ber .-.et .-.^a .-.sr. ; 

25 30 

Val Lys Asp 31y Phe He lie Arg Lys Pro Thr 

40 45 

Ser Arg Ala Arg Lys Met Lys He Ala Lys Met 

55 6 0 

Gly Tyr Gly Lys Arg Lys Gly Thr Arg Glu Ala 

7 0 7 5 8 0 

Val Leu Trp Met Arg Arg Met Arg Val Leu Arg 

90 95 

T;/r Arg 31u Tnr Lys Lys lie A.sp Lys His r^!et 

10 5 11 C' 

Met Arg Val Lys Gly Asn Val Phe Lys Asn Lys 

120 125 

Ser He His Lys Ser Lys Ala Lys Leu Gly Glu 
135 140 



210> 60 
211; 147 
2 12; PRT 
<2 13;' Homo sapiens 



Lys 
1 

Leu 
Gin 
Val 

Arg 

65 
Ala 

Arg 

Met 

Lys 

Arg 
145 



< 4 0 0 ;- 
Arg Leu 

Asp PrC' 

Gln He 
35 

Thr Va.i 

Lys Gly 

Arg Met 

Arg Leu 

Tyr Hi- 
lls 
Arg lie 
13 0 



60 

Ala Ser Ser Val 
5 

Asn Glu Thr Asn 
20 

Arg Lys Leu He 

His Ser Arg Ala 
55 

Arg His Met Gly 

70 

Pro Glu Lys Val 
85 

Leu Arg Arg Zyr 

100 

Ser Leu T\'r Leu 



Leu Met Glu His 
135 



jeu Arg 



Glu 

Lys 

40 

Arg 

He 

Thr 

Arg 

Lys 
120 
He 



He 

25 

Asp 

Cys 

Gly 

Trp 

Glu 

105 
Val 

His 



Cys Gly 
1 0 

Ala Ala 
Gly Leu 
Arg Lys 

Lys Arg 

75 

Met Arg 
9 0 

Ser Lys 
Lys Gly 
Lvs Leu 



Lys Lys 

Asn Ala 

He He 
45 

Asn Thr 
6 0 

Lys Gly 

Arg Met 

Lys He 

Asn Val 
125 
Lys Ala 
14 0 



Lys Val Trp 
15 

Asn Ser Arg 
3 0 

Arg Lys Pro 
Leu Ala Arg 

Thr Ala Asn 

80 

Ai"g He Leu 
95 

Asp' Arg His 
1 1 C 

Phe L\'s Asn 

Asp Lys Ala 



io 10 



